Distributed Reinforcement Learning for Flexible and Efficient UAV Swarm Control

نویسندگان

چکیده

Over the past few years, use of swarms Unmanned Aerial Vehicles (UAVs) in monitoring and remote area surveillance applications has become widespread thanks to price reduction increased capabilities drones. The drones swarm need cooperatively explore an unknown area, order identify monitor interesting targets, while minimizing their movements. In this work, we propose a distributed Reinforcement Learning (RL) approach that scales larger without modifications. proposed framework relies on possibility for UAVs exchange some information through communication channel, achieve context-awareness implicitly coordinate swarm's actions. Our experiments show method can yield effective strategies, which are robust channel impairments, easily deal with non-uniform distributions targets obstacles. Moreover, when agents trained specific scenario, they adapt new one minimal additional training. We also our achieves better performance compared computationally intensive look-ahead heuristic.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Reinforcement Learning for Motor Control

Artificial learners often require many more trials than humans or animals when learning motor control tasks in the absence of expert knowledge. We implement two key ingredients of biological learning systems, generalization and incorporation of uncertainty into the decision-making process, to speed up artificial learning. We present a coherent and fully Bayesian framework that allows for effici...

متن کامل

Efficient Distributed Reinforcement Learning Through Agreement

Distributed robotic systems can benefit from automatic controller design and online adaptation by reinforcement learning (RL), but often suffer from the limitations of partial observability. In this paper, we address the twin problems of limited local experience and locally observed but not necessarily telling reward signals encountered in such systems. We combine direct search in policy space ...

متن کامل

Efficient reinforcement learning: model-based Acrobot control

|Several methods have been proposed in the reinforcement learning literature for learning optimal policies for sequential decision tasks. Q-learning is a model-free algorithm that has recently been applied to the Acrobot, a two-link arm with a single actuator at the elbow that learns to swing its free endpoint above a target height. However, applying Q-learning to a real Acrobot may be impracti...

متن کامل

Guided Deep Reinforcement Learning for Swarm Systems

In this paper, we investigate how to learn to control a group of cooperative agents with limited sensing capabilities such as robot swarms. The agents have only very basic sensor capabilities, yet in a group they can accomplish sophisticated tasks, such as distributed assembly or search and rescue tasks. Learning a policy for a group of agents is difficult due to distributed partial observabili...

متن کامل

Autonomous UAV Navigation Using Reinforcement Learning

Unmanned aerial vehicles (UAV) are commonly used for missions in unknown environments, where an exact mathematical model of the environment may not be available. This paper provides a framework for using reinforcement learning to allow the UAV to navigate successfully in such environments. We conducted our simulation and real implementation to show how the UAVs can successfully learn to navigat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Cognitive Communications and Networking

سال: 2021

ISSN: ['2332-7731', '2372-2045']

DOI: https://doi.org/10.1109/tccn.2021.3063170